ARTIFICIAL INTELLIGENCE Microsoft’s Updated DeepSpeed Can Train Trillion-Parameter AI Models With Fewer GPUs Kyle Wiggers | TechCrunch “The enhanced DeepSpeed leverages three techniques to enable ‘trillion-scale’ model training: data parallel training, model parallel training, and pipeline parallel training. …Thanks to these and other performance improvements, Microsoft says a trillion-parameter model could be scaled across as […]